Antibodies specific for seroreactive regions on HPV 16 protein E1

ABSTRACT

The invention relates to seroreactive regions on proteins E1 and E2 of human papillomavirus (HPV) 16. The application also relates to a vaccine which contains such peptides which contain the seroreactive regions. The invention likewise embraces compositions for diagnostic purposes which contain peptides with the seroreactive regions.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional of U.S. application Ser. No. 09/758,318, filed Jan. 12, 2001, now U.S. Pat. No. 6,531,127; which is a divisional of U.S. application Ser. No. 08/468,337, filed Jun. 6, 1995, now U.S. Pat. No. 6,221,577; which is a divisional of U.S. application Ser. No. 08/237,418, filed May 3, 1994, now U.S. Pat. No. 5,601,973; which is a continuation of application Ser. No. 07/913,613, filed Jul. 16, 1992, now abandoned. Applicants rely on the disclosures, and claim the benefit of the filing date, of all of these applications and patents.

BACKGROUND OF THE INVENTION

The invention relates to seroreactive regions on proteins E1 and E2 of human papillomavirus (HPV) 16.

The application also relates to a vaccine which contains such peptides which contain the seroreactive regions.

The invention likewise embraces compositions for diagnostic purposes which contain peptides with the seroreactive regions.

HPV 16 is one of the human papillomaviruses (Proc. Natl. Acad. Sci., USA 80, 3813-3815 (1983). The organization of the genome of HPV 16 has been described in Virology, 145,181-185 (1985).

Genomic sequences of HPV can be detected in most cases of preinvasive and invasive cervical tumors. HPV 16 has been identified world-wide as the virus type predominating in these tumors. The HPV 16 genome is detectable in more than 50% of cervical tumors, in which case it is often present integrated into the cellular DNA. Little is known about the immune response after infections with HPV 16 or other papillomaviruses.

DESCRIPTION OF THE INVENTION

Initial data: patients suffering from cervical tumors were compared with healthy individuals with regard to the presence of antibodies against viral proteins. These viral proteins were then linked as fusion products with various prokaryotic peptides at their N terminus and then used as antigens in Western blots.

The object of the present invention is the further identification of HPV 16 viral structures which can be used as tool in the prophylaxis, diagnosis and therapy of HPV 16-dependent tumorous diseases in humans. The identification of such structures is a prerequisite for the development of ELISAs which make it possible to test a large quantity of human sera for the presence of HPV 16.

The present invention therefore embraces seroreactive 5 regions of the E1 protein of HPV 16, which have one of the following amino-acid sequences:

I. NGWFYVEAVVEKKTGDAISDDENENDSDTGEDLVDFIVNDNDYLT; (SEQ ID NO:1) II. NENDSDTGEDLVDFIVND; (SEQ ID NO:2) III. MADPAGTNGEEGTGCNGWFYVEAVVEKKTGDAISDDENENDSDTGEDLVDFIV (SEQ ID NO:3) NDNDYLT; IV. EDLVDFIVNDNDYLT; (SEQ ID NO:4) V. EDLVDFIVNDNDYLTQAETETAHALFTAQEAKQH; (SEQ ID NO:5) VI. NENDSDTGEDLVDFIVNDNDYLTQAETETAHALFTAQEAKQHR (SEQ ID NO:6) DAVQVLKRKYL VII. GSPLSDIS; (SEQ ID NO:7)

seroreactive regions of the E2 protein of HPV 16, which have one of the following amino-acid sequences;

(SEQ ID NO:8) I. DKILTHYENDS; (SEQ ID NO:9) II. DKILTHYENDSTDLRDHI; (SEQ ID NO:10) III. DLRDHIDYWKH; (SEQ ID NO:11) IV. AIYYKAREMGFKHINHQVVPTLA; (SEQ ID NO:12) V. AIYYKAREMGFKHINHQVVPTLAVSKNKAL (SEQ ID NO:13) VI. YYKAREMGFKHINHQVVPTLAVSKN; (SEQ ID NO:14) VII. INHQVVPTLAVSKNKALQAI; (SEQ ID NO:15) VIII. INHQVVPTLAVSKNKAL; (SEQ ID NO:16) IX. LAVSKNKALQAIE1QLTLETIYNSQYSNEKWTLQDV; (SEQ ID NO:17) X. QLTLETIYNSQYSNEKWTLQDVSLE; (SEQ ID NO:18) XI. TLETIYNSQYSNEK; (SEQ ID NO:19) XII. TSVFSSNEVSSPEII; (SEQ ID NO:20) XIII. VFSSNEVSSPEIIRQHLANHPAATHTKAVALGTEET; (SEQ ID NO:21) XIV. EIIRQHLANHPAATHTKAVALGTEETQTTIQRPRSEP; and (SEQ ID NO:22) XV. TEETQTTIQRPRSEPDTGN.

The invention furthermore embraces peptides with one or more of the seroreactive regions identified above, a vaccine which contains one or more of the peptides identified above, a composition for diagnostic purposes for the identification of specific antibodies against HPV E1 and/or E2 protein, which likewise contains the peptides identified above, monoclonal antibodies which have an affinity for one or more of the seroreactive regions of the E1 or E2 protein of HPV 16, and a composition for diagnostic purposes which contains these monoclonal antibodies.

In order to identify seroreactive regions in proteins E1 and E2 of HPV, the experimental route described in Science 228, 1315-1317 (1985) was followed. Subgenomic HPV 16 DNA fragments which had been randomly generated by ultrasound treatment and partial DNAse I treatment were cloned into the phage vector fusE1 and then expressed as part of a phage coat protein. Seroreactive phage recombinants were identified using sera prepared against E1 and E2, and purified, and the seroreactive regions were characterized by sequencing the HPV 16 portion. Polyclonal rabbit sera against an HPV 16 E1 MS2 polymerase fusion protein and against the amino- and carboxyl-terminal part of HPV 16 E2 (separate, likewise ) MS2 fusion proteins) were prepared.

The filamentous phages embrace the three groups f1, fd, and M13. It is common to them all that binding and uptake of the phages takes place via F pili of the bacteria, i.e. that only F⁺ strains can be infected. The fd wild-type phage, from which the vector system used is derived, forms particles which are about 900×6 nm in size and which are composed in particular of about 2700 subunits of the main coat protein. In addition, in each case, 5 molecules of the minor coat proteins pIII, pVI, pVII, and pIX are located at both ends of the virions. The single-stranded, circular phage genome which, in the case of the fd wild-type, is 6408 bp in size, carries the information for a total of 10 different proteins.

In the fd derivatives fuse1, fuse2 (Parmley and Smith, Gene, 7, 305-318 (1988)), and fusemm, a tetracycline-resistance gene is integrated, by insertion of a part of the Tn10 transposon, in the phage genome, which has been enlarged to about 9.2 kbp in this way. This means that the replicative DNA double-stranded phage genomes behave in the bacteria like selectable plasmids and can accordingly be prepared and used for clonings. Another modification from the wild-type is the presence of a reading frame mutation in the gene for the minor coat protein pIII in conjunction with an inserted restriction site for cloning expressable DNA fragments. The gene for pIII is composed of two almost completely independent domains (Crissmann and Smith, 1984): an N-terminal domain which mediates the binding of the phages to the bacterial cell receptor (F pili), and a C-terminal protein domain which is responsible for phage morphogenesis. The reading frame mutation, which is located directly behind the signal sequence of the protein, thus leads to inactivation of the gene and accordingly also prevents the formation of infectious particles. This is of importance for the replication of these phage mutants as plasmids because the fd genomes inactivated in the morphogenesis do not damage the host bacteria (Smith, in: Vectors, A Survey of Molecular Cloning Vectors and Their Uses, Butterworth Publishers, Stoneham, Mass. 61-85, 1987).

Insertion of suitable DNA fragments and restoration of gene III functions lead to the formation of infectious phage particles which carry additional amino-acid sequences on their coats. These sequences are accessible to various ligands, for example antibodies, in the natural state of the phages.

The fd expression system used in this invention is essentially based on setting up phage banks by cloning DNA foreign sequences into the gene III, and examining the latter with the aid of monoclonal or polyclonal sera for seroreactive recombinants. An amplification normally takes place in preparation of these expression banks. The extent of this replication of individual clones in turn depends on the nature and size of the inserted DNA sequence. This means that different clones differ in frequency, which may differ by up to several powers of ten. It is therefore possible to derive from the stated properties the following two features of the fd expression banks:

Amplification of the banks, which leads to repeated cloning of identical phage clones isolated by immunoscreening;

Possibility of enriching seroreactive phages by affinity chromatography (columns) because phages in the active state can be bound and eluted again.

The repeated isolation of identical recombinants was avoided by using, separately set up banks, there being an extremely low probability of cloning a DNA fragment prepared identically and in parallel, or of the phage recombinant derived therefrom.

In this invention, a total of 11 different expression banks for HPV 16 DNA in fusE1 were set up. The number of primary, tetracycline-resistant, and insert-harboring recombinants was, in this case, between 2,000 and 90,000 per bank. Since complete plasmids composed of about 4 kb vector portion and 8 kb HPV portion in sheared form were always used for the cloning, the HPV-containing fd recombinants are reduced by about 30%. The fragments cloned in were then expressed, as already mentioned, as a fusion protein of the gene III coat protein. The cloning site in gene III is in this case directly behind the translated signal sequence for protein export. In order to restore the function of the gene, it is necessary for an insert to have a defined size (3n+2; n=0, 1, 2, 3 . . . ). In order to express a defined protein sequence as fusion protein of the gene III product, it is necessary in addition for both the 3′ and the 5′ junction to be in the correct reading frame, and for the corresponding insert to be present in the correct orientation. This therefore applies overall to only about every 18th (3×3×2) HPV DNA-containing recombinant. A small portion thereof is in turn inactivated by translation stop codons present in the insert, or by proteins which are not functional because of their folding. Because of the stated parameters it is difficult to estimate what is the minimum number of recombinants necessary to express with great probability any required part of an HPV genome as fd fusion protein in the phage bank. In papillomaviruses, about 10 kb of the genome (partly by overlapping open reading frames) code for proteins. Of 2,000 tetracycline-resistant insert-harboring recombinants, about 100 ({fraction (1/18)}) clones express HPV sequences in a suitable manner. With average HPV fragment sizes of 50-150 bp, the expressed HPV sequence amounts to about 5,000-15,000 bp. In fact, fd banks with about 2,000 recombinants prove to be sufficiently representative.

In order to ensure the specificity of the immuno-screening, either several different recombinants of a seroreactive region or at least several identical but independently isolated phage recombinants were always isolated.

The amino-acid position indicators in FIGS. 2 and 5 relate to the E1 and E2 proteins, and not to the positions of the open reading frames. The first methionine was given position 1.

EXAMPLES Example 1

Preparation of Polyclonal Antisera Against HPV 16 E1

In order to isolate seroreactive phage recombinants from the HPV 16 fd expression bank, initially polyclonal rabbit sera against HPV 16 E1 MS2 fusion proteins were prepared. For this, the Pst I A fragment of HPV 16 (bp 875-3693) was cloned into the Pst I cleavage site of the expression vector pEX12mer (Seedorf et al., EMBO J. 6, 139-144, 1987), by which amino acids 5-649 of HPV 16 E1 ORF are expressed (FIG. 1). This vector is a derivative of the plasmid pPLC24 (Remaut et al., Gene 15, 81-93,1981) which has been modified by insertion of the pUC8 polylinker behind the MS2 polymerase portion. The fusion protein is transcribed in the pEX12mer by the temperature-inducible lambda pL promoter. The N-terminal fusion portion of the MS2 protein amounts to 100 amino acids.

Since the original HPV 16 isolate (Seedorf et al., Virology, 145, 181-185, 1985) has a reading frame mutation in the region of the E1 open reading frame (nucleotide position 1138), recourse was had to an HPV 16 isolate from a cervical carcinoma with a complete E1 ORF. Because of the selected restriction cleavages, the HPV 16 E1 open reading frame (bp 865-2811) is thus completely expressed apart from three N-terminal amino acids.

The clonings and plasmid analyses were initially carried out using the E. coli strain W6 in which there is constitutive expression of the repressor for the lambda promoter. This prevented expression of the fusion proteins, in order to prevent counterselection after the transformations. After examination of the cloning by restriction analysis, and Southern blot hybridization with radioactive labelled HPV 16 DNA (Pst I A fragment), the plasmid DNA of the construct was used for transformation in E. coli N6045. This strain is able, because of its temperature-sensitive repressor of the lambda promoter, to express the MS2 fusion proteins.

It was then possible in a Western blot to examine, with the aid of a monoclonal antibody directed against the MS2 portion of the fusion protein, by comparison of extracts from induced and non-induced bacteria the size and the expression rate of the fusion protein. Since the band of the MS2 E1 fusion protein corresponded to the expected size of about 90 kD, no examination of the cloning junctions by sequencing was carried out. In the two other reading frames of the HPV 16 Pst I A fragment, expression of larger proteins is impossible because of the presence of translation stop codons. In addition, both Pst I cleavage sites of the vector-insert junctions were retained. Correct expression of the E1 open reading frame was confirmed by the results of the inununoscreening of the HPV 16 fd expression banks, which are described in the following section.

The MS2-E1 fusion protein was then purified from induced E. coli cultures by differential extraction and by electroelution from SDS polyacrylamide gels, and was then used to immunize two rabbits.

Example 2

Identification of Seroreactive Regions on the HPV 16 E1 Protein

Both of the polyclonal rabbit sera prepared against HPV 16 E1 were used to examine five different HPV 16 fd expression banks for reactive recombinants. It was possible in this way to identify a total of at least two different antibody binding sites represented by non-overlapping phage clones. In total, 19 independent phage clones which contain seven different classes of HPV 16 inserts were isolated (FIG. 2). Six classes have a common overlapping region which codes for the HPV 16 E1 specific peptide EDLVDFIVND (SEQ ID NO: 23). The second identified epitope on the E1 protein is represented by a recombinant phage (clone 1059) which codes for the E1 peptide GSPLSDIS (SEQ ID NO: 7).

The original HPV 16 isolate has a reading frame mutation in the E1 open reading frame (nucleotide position 1138). The DNA of this HPV 16 isolate was used to prepare the fd expression banks. Two of the isolated seroreactive fd recombinants contain this region and therefore also have the reading frame mutation. In clone 1145 this leads to a change of reading frame, and this results in C-terminal attachment of three HPV 16-E2 non-specific amino acids ( . . . ValValHis). Clone 1059 starts in the wrong frame and is converted into the correct HPV 16 E1 reading frame by the reading frame mutation of the HPV 16 isolate used. The clone codes for the peptide STGSKTKVFGSPLSDIS (SEQ ID NO: 24), of which only the C-terminal amino acids . . . GSPLSDIS (SEQ ID NO: 7) derive from the actual HPV 16 E1 protein and must form the epitope.

Both clones which contain the reading frame mutation have the correct insert size (3n+2 base pairs) to restore the reading frame of gene III of the phage vector.

Example 3

Preparation of Polyclonal Antisera Against HPV 16 E2

Like the case of the HPV 16 E1 open reading frame, no suitable antisera were available for the HPV 16 E2 protein either. For this reason, the HPV 16 E2 open reading frame (nucleotide position 2756-3850; AA 1-365) was expressed in the vector pEX12mer as already described for the E1 protein.

Firstly the HPV 16 DNA fragment was cloned via the Hinf I cleavage site at position 2761 into the pEX12mer vector. In this case the starting material was an already sub-cloned HPV 16 fragment (bp 2367-4467). This fragment was cut out of the vector again, via the additionally inserted non-HPV 16-specific restriction sites Xba I (5′ end) and Bam HI (3′ end), and prepared. This DNA fragment, which is 2.1 kb in size (Xba II/Bam HI), was then partially cut with Hinf I. This results, among other things, in a fragment which is 1700 hp in size between the 3′-terminal Bam HI cleavage site and Hinf I site at bp 2761. The internal Hinf I cleavage site (bp 3539) in this fragment is uncleaved, and the HPV 16 E2 ORF is completely present apart from three amino-terminal amino acids. After preparation, the Hinf/Bam fragment was cloned into the pEX12mer expression vector which had been cleaved with Bam HI. This resulted, via the compatible Bam HI sites, in linear products of vector and insert. The free ends of these products were filled in with Klenow polymerase and then closed by ligation. This results in an MS2-E2 junction at the filled-in cleavage sites Bam HI (vector) and Hinf I (E2 insert) with loss of the two restriction sites. Using Eco RI/Bam HI double restriction cleavages, it was possible to identify recombinants which harbored the HPV 16 E2 fragment in the correct orientation.

After transformation into the E. coli expression strain 6045 it was not possible, using a monoclonal antibody directed against the MS2 polymerase, to find any production whatever of the MS2 fusion protein. In order to rule out a displacement of the reading frame at the MS2-E2 junction, the plasmid DNA of a total of 16 different MS2-E2 recombinants was hybridized in a Southern blot with an oligonucleotide derived from the correct Bam HI/Hinf I junction. Since an unambiguous hybridization signal was identifiable with 15 clones, it was assumed that the cloning had taken place in the correct reading frame, and expression of the complete E2 ORF is not possible in pEX vectors. As a substitute, the HPV 16 E2 protein was then expressed in two halves in the pEX12mer vector.

Example 4

Expression of the Amino-terminal Region of HPV 16 E2

The amino-terminal region of the E2 open reading frame between nucleotide position 2761 and 3209 was cloned into the pEX12mer vector and expressed. Since the E2 open reading frame starts at nucleotide position 2756, the MS2-E2 fusion protein lacks the first two amino acids (Met-Glu) of the E2 protein (FIG. 4).

Plasmid DNA composed of pEX12mer and HPV 16 E2, which were obtained from the cloning described above, was truncated at the carboxyl terminus (i.e., 3′ end) by deletion of a Hinc II (HPV 16 bp 3209)/Bam HI fragment and religation (blunt/flush from Hinc II and Bam HI). This results in expression of the N-terminal part of HPV 16 E2 between nucleotide position 2761 (Hinf I) and 3209 (Hinc II). A fusion protein about 30 kD in size was detectable in induced bacteria in a Western blot with an anti-MS2 molecule antibody.

The fusion protein was purified by differential extraction of the induced bacterial lysate and by electroelution of the protein band from SDS polyacrylamide gels stained with Coomassie blue, and used for immunizing rabbits.

Example 5

Expression of the Carboxyl-terminal Region of HPV 16 E2

The C-terminal region of the HPV 16 E2 open reading frame between nucleotide position 3209 and 3850 was expressed in the pEX12mer vector (FIG. 3). The region is thus directly connected to the expressed amino-terminal part, described above, of the HPV16 E2 open reading frame.

For this, recourse was had to the Xba/Bam fragment which has been described above and which contains the complete HPV 16 E2 reading frame. After restriction cleavage, a Hinc II/Bam HI fragment (nucleotide position 3209-4467) which contains the carboxyl-terminal half of HPV 16 E2 was isolated. This fragment was inserted into the Bam HI cleavage site of the pEX12mer expression vector (5′Bam HI/Hinc II-Bam HI/Bam HI 3′). It was possible with the aid of the anti-MS2 monoclonal antibody to identify in extracts of induced bacteria a fusion protein of about 30 kD, which was purified by differential extraction and electroelution from SDS polyacrylamide gels, and was used to immunize rabbits.

Example 6

Identification of Seroreactive Regions on the HPV 16 E2 Protein

Available for the immunoscreening of the fd HPV 16 expression banks was a total of four different anti-HPV 16 E2 antisera: in each case two sera against the amino-terminal part (bp 2761-3209; AA 3-152) and two against the carboxyl-terminal part of the E2 (bp 3209-3850; AA 153-365) open reading frame. These sera were used to examine five different expression banks for seroreactive recombinants. This resulted in isolation of a total of 32 clones, of which 26 contain amino-terminal sequences of the E2 protein. These 26 clones form a total of 11 different classes which represent four different non-overlapping regions (FIG. 5).

All the epitopes are located in a restricted region comprising 88 amino acids of the amino terminus of the E2 open reading frame which is located between nucleotide position 2792 (AspLyslle . . . ) and 3055 ( . . . SerLeuGlu).

It was possible to locate in the carboxyl-terminal region at least two independent non-overlapping epitopes (TSVFSSNEVSSPEII (SEQ ID NO: 19) and TEETQTTIQRPRISEPDTGN (SEQ ID NO: 22), FIG. 5). These are represented by a total of four classes of recombinants with six independent isolates. The region of the E2 open reading frame which is covered by the clones is located between nucleotide position 3343 (ThrSerVal . . . ) and 3502 ( . . . ThrGlyAsn) and comprises 52 amino acids.

Five classes of recombinants (12 isolates) extend over nucleotide position 2926. All the clones have a point mutation (A→G transition) here, but this does not lead to a change in the corresponding amino acid (glutamine).

Example 7

Immunoscreening of fd Phase Expression Banks

1. Phage Affinity Concentration with Protein A-Sepharose Columns

The phage banks prepared in the fd phage expression system used unavoidably underwent amplification on cloning. The extent of this replication of the original clones is in turn greatly influenced by the nature of the individual recombinants, for example by different sizes of inserts or conformation of the coat proteins, inhibition of physiological processes in the infected bacteria and many others, and it was therefore not to be expected that uniform amplification of all phages takes place. In order to isolate underrepresented phage recombinants or clones from large libraries, seroreactive phage recombinants were concentrated. For this, use was made of the circumstance that the foreign sequences expressed in each case appear as part of an fd gene III fusion protein on the coat of natural phage particles. Large amounts of phages (10⁹-10¹² particles) were for this purpose bound to protein A antibody columns and eluted again.

For this, initially protein A-Sepharose was swollen with PBS for 30 min and was washed with PBS. Subsequently the protein A-Sepharose was incubated with about 1 to 2 ml of suitable polyclonal sera (rabbit or human) or with corresponding protein A-binding monoclonal antibodies in Eppendorf reaction tubes on a rotary shaker at 4° C. for 1 to 2 days. Subsequently, the protein A-Sepharose was washed 10 times by the Sepharose, being alternately resuspended in 10 ml of PBS and pelleted again by centrifugation (2 min, 6,000 rpm). The protein A-Sepharose-IgG complexes formed were then incubated with an appropriate amount of phages as above. Then the Sepharose was washed with PBS several times as before and packed into a Pasteur pipette closed with a glass bead and washed with several liters (2-15 I) flowing through. The column material was removed and then incubated in the same volume of elution buffer (1 mg/ml BSA, 0.1 M HCl, glycine, pH 2.2) for 15 min. After brief centrifugation, the supernatant, which now contains free phages and antibodies, was neutralized with ⅕ of the volume of tris base (0.5 M). Antibodies which recognize the re-combinant gene are able to inhibit binding of the phage to the bacterial cells and thus the cycle of infection. For this reason the phages were added in 100-200 μl aliquots of the eluates immediately after neutralization to exponentially growing E. coli K91 and plated out on complete medium plates. It emerged during the work that replica filters of these phage platings were unsuitable for immunoblotting, probably because of contaminants in the eluate. For this reason the resulting plaques were again rinsed off the plates with complete medium and subsequently plated out from the phage suspensions obtained in this way, which had undergone renewed amplification, and the immunoblotting was carried out on minimal agar plates.

2. Phage Platings and Preparation of Nitrocellulose Replica Filters for the Immunoblotting

All the fd phage derivatives were plated out on a lawn formed by E. coli K91 (Lyons and Zinder, Virology, 49, 45-60, 1972). This strain is distinguished by a large number of F pili (5 per cell, compared with about 0.5 per cell in most F⁺ strains) which are responsible for uptake of filamentous phages. This is particularly important for the fd expression system used in this study because the recombinant fuse phages have, owing to the uptake of a part of the Tn10 transposon (tetracycline resistance), a genome which is distinctly enlarged compared with the wild-type, and for this reason form particularly small plaques.

To plate out the phages, a R91 overnight culture was diluted 1:100 in complete medium (2×YT) and incubated at 37° C. for 3 to 4 h. After a density of E₆₀₀=0.8-1.2 was reached, 200 μl of the bacteria were plated out with an appropriate amount of phages, together with 3.5 ml of agarose (0.6% agarose, 10 mM MgSO₄, 50° C.) on prewarmed bacteria plates. Minimal agar plates were always used for every plating intended to be used for nitrocellulose replicas for the immunoscreening. Platings out for determination of the phage titer or for DNA hybridization were carried out on complete medium plates.

Use of complete medium plates for the immunoblotting always lead to very high non-specific reactivity of the filters with the sera used.

The plates were incubated at 37° C. overnight. After about 16 hours, a nitrocellulose filter was placed on for 10-15 min, marked with four asymmetric pricks with a needle, and removed again using flat-ended forceps. The filters were labelled and then inverted onto a fresh minimal agar plate and incubated further at 37° C. for 5-6 hours. This increased the amount of phage particles (proteins) on the filter since the reincubation makes it possible for the bacteria and phages bound to the filters to grow further via the nutrients diffusing from the plate. Subsequently the filters were removed and saturated in 10% milk (skim milk powder in PBS) for 30-60 min. The filters were then incubated with suitable dilutions of appropriate sera in 5% milk at 4° C. overnight.

3. Immunostaining of Replica Filters and Cloning of Reactive Recombinants

After removal of the replica filters, blocking in 10% milk (in PBS) for 60 min and overnight incubation with antisera, the nitrocellulose filters were washed in PBS, 0.05% Tween 20 (5 changes of washing buffer) for 30 min. The filters were then incubated with 1:1000 dilutions of appropriate second antibodies (peroxidase-coupled goat anti-human, anti-rabbit, or anti-mouse) in 5% milk at RT for 2 h. This was followed by renewed washing (see above) and incubation in the following staining mixture:

40 mg of diaminobenzidine 30 μl of 30% H₂O₂ 1.5 ml of 1% NiSO₄ in 50 ml of PBS.

After sufficient color had developed, the filters were removed from the solution, placed in water for 30 min, and then dried on 3 MM paper.

The prick holes and signals on the filters were then copied onto a sheet or the lid of a bacteria dish. This made it possible to assign a position or, if the phage dilution was sufficiently large (round 2 or higher), a plaque to a signal. A sterile toothpick was gently stabbed into the position or the plaque, and the toothpick was placed in 500 μl of complete medium for 10-15 min. This phage suspension then contained generally about 10⁶-10⁷ infectious particles, which comprises about 0.1-1% of the phages in a plaque. The phage suspensions were then incubated at 65° C. for 15-20 min in order to kill bacteria which had been carried over, and were then 5 stored at 4° C.

DESCRIPTION OF THE FIGURES

FIG. 1

Cloning of the E1 open reading frame into the expression vector pEX12mer. The figure includes DNA sequences SEQ ID NO:25, SEQ ID NO: 27, SEQ ID NO:29, SEQ ID NO:31, and amino acid sequences SEQ ID NO: 26, SEQ ID NO:28, SEQ ID NO:30, and SEQ ID NO:32.

FIG. 2

Seroreactive regions on the HPV 16 E1 protein. The figure includes amino acid sequences SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:24. Small letters indicate the amino acids of clones 1145 and 1059 which, because of the change in reading frame of the HPV 16 isolate used for cloning the fd banks, are not derived from the HPV 16 E1 protein (see text). Clones 1090, 1079, 1084, 1029, 1099, and 1145 have a common region of 10 amino acids (EDLVDFIVND; SEQ ID NO:23) which possibly represents a common epitope of the clones, although other antibody binding sites on these clones cannot be ruled out. Clone 1059 has, because of the change in reading frame, no common amino-acid sequences with the other clones, although the insert of this clone overlaps with the insert of clone 1145. The position indications relate to the HPV 16 E1 open reading frame. The amino acids of clones 1145 and 1059, which do not derive from E1, are not taken into account here.

FIG. 3

Cloning of the carboxyl-terminal half of the HPV 16 E2 protein into the expression vector pEX12mer. The figure includes DNA sequences SEQ ID NO:25, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, and amino acid sequences SEQ ID NO:26, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, and SEQ ID NO:40.

FIG. 4

Cloning of the amino-terminal half of the HPV 16 E2 protein into the expression vector pEX12mer. The figure includes DNA sequences SEQ ID NO:25, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:41, SEQ ID NO:43 and amino acid sequences SEQ ID NO:26, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:42, and SEQ ID NO:44.

FIG. 5

Seroreactive regions on the HPV 16 E2 protein. The figure includes amino acid sequences SEQ ID NO:8, SEQ ID NO:9, SEQ D NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22. The regions (E2-1066, -1170, -1074, -1112) on the carboxyl-terminal half of HPV 16 E2 are all located in a region 88 amino-acids long (between AA 13 and 100) and partially overlap. The carboxyl-terminal regions are also closely adjacent (between AA 197 and 249). The two regions are in each case arranged approximately proportional to their position on the E2 protein.

                   #             SEQUENCE LISTING (1) GENERAL INFORMATION:    (iii) NUMBER OF SEQUENCES: 44 (2) INFORMATION FOR SEQ ID NO: 1:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 45 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #1: Asn Gly Trp Phe Tyr Val Glu Ala Val Val Gl #u Lys Lys Thr Gly Asp 1               5    #                10   #                15 Ala Ile Ser Asp Asp Glu Asn Glu Asn Asp Se #r Asp Thr Gly Glu Asp             20       #            25       #            30 Leu Val Asp Phe Ile Val Asn Asp Asn Asp Ty #r Leu Thr         35           #        40           #        45 (2) INFORMATION FOR SEQ ID NO: 2:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 18 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #2: Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Le #u Val Asp Phe Ile Val 1               5    #                10   #                15 Asn Asp (2) INFORMATION FOR SEQ ID NO: 3:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 60 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #3: Met Ala Asp Pro Ala Gly Thr Asn Gly Glu Gl #u Gly Thr Gly Cys Asn 1               5    #                10   #                15 Gly Trp Phe Tyr Val Glu Ala Val Val Glu Ly #s Lys Thr Gly Asp Ala             20       #            25       #            30 Ile Ser Asp Asp Glu Asn Glu Asn Asp Ser As #p Thr Gly Glu Asp Leu         35           #        40           #        45 Val Asp Phe Ile Val Asn Asp Asn Asp Tyr Le #u Thr     50               #    55               #    60 (2) INFORMATION FOR SEQ ID NO: 4:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 15 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #4: Glu Asp Leu Val Asp Phe Ile Val Asn Asp As #n Asp Tyr Leu Thr 1               5    #                10   #                15 (2) INFORMATION FOR SEQ ID NO: 5:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 34 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #5: Glu Asp Leu Val Asp Phe Ile Val Asn Asp As #n Asp Tyr Leu Thr Gln 1               5    #                10   #                15 Ala Glu Thr Glu Thr Ala His Ala Leu Phe Th #r Ala Gln Glu Ala Lys             20       #            25       #            30 Gln His (2) INFORMATION FOR SEQ ID NO: 6:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 54 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #6: Asn Glu Asn Asp Ser Asp Thr Gly Glu Asp Le #u Val Asp Phe Ile Val 1               5    #                10   #                15 Asn Asp Asn Asp Tyr Leu Thr Gln Ala Glu Th #r Glu Thr Ala His Ala             20       #            25       #            30 Leu Phe Thr Ala Gln Glu Ala Lys Gln His Ar #g Asp Ala Val Gln Val         35           #        40           #        45 Leu Lys Arg Lys Tyr Leu     50 (2) INFORMATION FOR SEQ ID NO: 7:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 8 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #7: Gly Ser Pro Leu Ser Asp Ile Ser 1               5 (2) INFORMATION FOR SEQ ID NO: 8:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 11 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #8: Asp Lys Ile Leu Thr His Tyr Glu Asn Asp Se #r 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 9:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 18 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #9: Asp Lys Ile Leu Thr His Tyr Glu Asn Asp Se #r Thr Asp Leu Arg Asp 1               5    #                10   #                15 His Ile (2) INFORMATION FOR SEQ ID NO: 10:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 11 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #10: Asp Leu Arg Asp His Ile Asp Tyr Trp Lys Hi #s 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 11:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 23 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #11: Ala Ile Tyr Tyr Lys Ala Arg Glu Met Gly Ph #e Lys His Ile Asn His 1               5    #                10   #                15 Gln Val Val Pro Thr Leu Ala             20 (2) INFORMATION FOR SEQ ID NO: 12:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 30 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #12: Ala Ile Tyr Tyr Lys Ala Arg Glu Met Gly Ph #e Lys His Ile Asn His 1               5    #                10   #                15 Gln Val Val Pro Thr Leu Ala Val Ser Lys As #n Lys Ala Leu             20       #            25       #            30 (2) INFORMATION FOR SEQ ID NO: 13:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 25 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #13: Tyr Tyr Lys Ala Arg Glu Met Gly Phe Lys Hi #s Ile Asn His Gln Val 1               5    #                10   #                15 Val Pro Thr Leu Ala Val Ser Lys Asn             20       #            25 (2) INFORMATION FOR SEQ ID NO: 14:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 20 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #14: Ile Asn His Gln Val Val Pro Thr Leu Ala Va #l Ser Lys Asn Lys Ala 1               5    #                10   #                15 Leu Gln Ala Ile             20 (2) INFORMATION FOR SEQ ID NO: 15:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 17 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #15: Ile Asn His Gln Val Val Pro Thr Leu Ala Va #l Ser Lys Asn Lys Ala 1               5    #                10   #                15 Leu (2) INFORMATION FOR SEQ ID NO: 16:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 37 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #16: Thr Leu Ala Val Ser Lys Asn Lys Ala Leu Gl #n Ala Ile Glu Leu Gln 1               5    #                10   #                15 Leu Thr Leu Glu Thr Ile Tyr Asn Ser Gln Ty #r Ser Asn Glu Lys Trp             20       #            25       #            30 Thr Leu Gln Asp Val         35 (2) INFORMATION FOR SEQ ID NO: 17:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 25 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #17: Gln Leu Thr Leu Glu Thr Ile Tyr Asn Ser Gl #n Tyr Ser Asn Glu Lys 1               5    #                10   #                15 Trp Thr Leu Gln Asp Val Ser Leu Glu             20       #            25 (2) INFORMATION FOR SEQ ID NO: 18:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 14 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #18: Thr Leu Glu Thr Ile Tyr Asn Ser Gln Tyr Se #r Asn Glu Lys 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 19:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 15 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #19: Thr Ser Val Phe Ser Ser Asn Glu Val Ser Se #r Pro Glu Ile Ile 1               5    #                10   #                15 (2) INFORMATION FOR SEQ ID NO: 20:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 36 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #20: Val Phe Ser Ser Asn Glu Val Ser Ser Pro Gl #u Ile Ile Arg Gln His 1               5    #                10   #                15 Leu Ala Asn His Pro Ala Ala Thr His Thr Ly #s Ala Val Ala Leu Gly             20       #            25       #            30 Thr Glu Glu Thr         35 (2) INFORMATION FOR SEQ ID NO: 21:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 37 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #21: Glu Ile Ile Arg Gln His Leu Ala Asn His Pr #o Ala Ala Thr His Thr 1               5    #                10   #                15 Lys Ala Val Ala Leu Gly Thr Glu Glu Thr Gl #n Thr Thr Ile Gln Arg             20       #            25       #            30 Pro Arg Ser Glu Pro         35 (2) INFORMATION FOR SEQ ID NO: 22:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 19 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #22: Thr Glu Glu Thr Gln Thr Thr Ile Gln Arg Pr #o Arg Ser Glu Pro Asp 1               5    #                10   #                15 Thr Gly Asn (2) INFORMATION FOR SEQ ID NO: 23:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 10 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #23: Glu Asp Leu Val Asp Phe Ile Val Asn Asp 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 24:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 17 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #24: Ser Thr Gly Ser Lys Thr Lys Val Phe Gly Se #r Pro Leu Ser Asp Ile 1               5    #                10   #                15 Ser (2) INFORMATION FOR SEQ ID NO: 25:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 51 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #25: TTGTCATGGG ATCTGAATTC CGGGGGGATC CGTCGACCTG CAGCCAAGCT T  #             51 (2) INFORMATION FOR SEQ ID NO: 26:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 17 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #26: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile Ar #g Arg Pro Ala Ala Lys 1               5    #                10   #                15 Leu (2) INFORMATION FOR SEQ ID NO: 27:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 42 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)         (xi) SEQUENCE DESCRIPTION: SEQ  #ID NO: 27: TTGTCATGGG ATCTGAATTC CGGGGGGATC CGTCGACCTG CA     #                   #  42 (2) INFORMATION FOR SEQ ID NO: 28:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 14 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #28: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile Ar #g Arg Pro Ala 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 29:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 33 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #29: GGTACCAATG GGGAAGAGGG TACGGGATGT AAT        #                   #         33 (2) INFORMATION FOR SEQ ID NO: 30:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 12 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #30: Ala Gly Thr Asn Gly Glu Glu Gly Thr Gly Cy #s Asn 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 31:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 75 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #31: TTGTCATGGG ATCTGAATTC CGGGGGGATC CGTCGACCTG CAGGTACCAA TG #GGGAAGAG     60 GGTACGGGAT GTAAT               #                   #                   #    75 (2) INFORMATION FOR SEQ ID NO: 32:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 25 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #32: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile Ar #g Arg Pro Ala Gly Thr 1               5    #                10   #                15 Asn Gly Glu Glu Gly Thr Gly Cys Asn             20       #            25 (2) INFORMATION FOR SEQ ID NO: 33:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 26 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #33: TTGTCATGGG ATCTGAATTC CGGGGG           #                   #              26 (2) INFORMATION FOR SEQ ID NO: 34:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 10 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #34: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 35:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 30 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #35: TTGTCATGGG ATCTGAATTC CGGGGGGATC          #                   #           30 (2) INFORMATION FOR SEQ ID NO: 36:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 10 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #36: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 37:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 27 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #37: ACTCTTTGCC AACGTTTAAA TGTGTGT           #                   #             27 (2) INFORMATION FOR SEQ ID NO: 38:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 9 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #38: Thr Leu Cys Gln Arg Leu Asn Val Cys 1               5 (2) INFORMATION FOR SEQ ID NO: 39:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 57 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #39: TTGTCATGGG ATCTGAATTC CGGGGGGATC ACTCTTTGCC AACGTTTAAA TG #TGTGT        57 (2) INFORMATION FOR SEQ ID NO: 40:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 19 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #40: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile Th #r Leu Cys Gln Arg Leu 1               5    #                10   #                15 Asn Val Cys (2) INFORMATION FOR SEQ ID NO: 41:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 30 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #41: GACTATTATG GTTTATATTA TGTTCATGAA          #                   #           30 (2) INFORMATION FOR SEQ ID NO: 42:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 10 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #42: Asp Tyr Tyr Gly Leu Tyr Tyr Val His Glu 1               5    #                10 (2) INFORMATION FOR SEQ ID NO: 43:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 60 base  #pairs           (B) TYPE: nucleic acid           (C) STRANDEDNESS: double           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA (genomic)     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #43: TTGTCATGGG ATCTGAATTC CGGGGGGATC GACTATTATG GTTTATATTA TG #TTCATGAA     60 (2) INFORMATION FOR SEQ ID NO: 44:      (i) SEQUENCE CHARACTERISTICS:           (A) LENGTH: 20 amino  #acids           (B) TYPE: amino acid           (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: peptide     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:  #44: Leu Ser Trp Asp Leu Asn Ser Gly Gly Ile As #p Tyr Tyr Gly Leu Tyr 1               5    #                10   #                15 Tyr Val His Glu             20 

What is claimed is:
 1. An isolated or purified antibody that binds to an amino acid sequence present in SEQ ID NO:1, SEQ ID NO:2, or SEQ ID NO:4.
 2. The antibody of claim 1, wherein the antibody detects or identifies the E1 protein of human papilloma virus 16 (HPV 16).
 3. The antibody of claim 1, which is a monoclonal antibody.
 4. The antibody of claim 1, which is a polyclonal antibody.
 5. A composition comprising the antibody of claim 1, and an antibody binding buffer.
 6. The composition of claim 5, which is a diagnostic composition.
 7. The composition of claim 5, wherein the composition is used for identifying, diagnosing, or detecting HPV 16 infection.
 8. A method of identifying, diagnosing, or detecting HPV 16 infection, said method comprising contacting a sample from a human suspected of being infected with HPV 16 with an isolated or purified antibody that binds to an amino acid sequence present in SEQ ID NO:1, SEQ ID NO:2, or SEQ ID NO:4, and detecting binding of said antibody to said amino acid sequence, wherein binding indicates an HPV 16 infection.
 9. The method of claim 8, wherein said detecting is by enzyme linked immunosorbent assay (ELISA).
 10. The method of claim 8, wherein said method detects binding of said antibody with the E1 protein of HPV
 16. 